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LOCUS ATF18022 100269 bp DNA linear PLN 13-APR-2000 

DEFINITION Arabidopsis thaliana DNA chromosome 5, BAC clone F 18022 (ESS A 

project). 
ACCESSION AL163817 
VERSION AL163817.1 GL7573446 
KEYWORDS . 

SOURCE Arabidopsis thaliana (thale cress) 
ORGANISM Arabidopsis thaliana 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; 
Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis. 
REFERENCE 1 (bases 1 to 100269) 
AUTHORS Bevan,M., Murphy,G., Ridley,P., Hudson.S., Bancroft,L, Mewes,H.W., 

Rudd,S., Lemcke,K. and Mayer,K.F.X. 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 100269) 
AUTHORS EU Arabidopsis sequencing,proj ect. 
TITLE Direct Submission 

JOURNAL Submitted (13-APR-2000) MIPS, at the Max-Planck-Institut fuer 
Biochemie, Am Klopferspitz 18a, D-82152 Martinsried, FRG, E-mail: 
lemcke@mips.biochem.mpg.de,mayer@mips.biochem.mpg.de Project 
Coordinator: Mike Bevan, Molecular Genetics Department, Cambridge 
Laboratory, John Innes Centre, Colney Lane, NR4 7UJ Norwich, UK, 
E-mail: michael.bevan@bbsrc.ac.uk 
COMMENT Information on performance of analysis and a more detailed 
annotation of this entry and other sequences of chromosomes 3, 4 
and 5 can be viewed at: http://www.mips.biochem.mpg.de/proj/thal/ 
FEATURES Location/Qualifiers " 

source 1.. 100269 

/organism- ' Arabidopsis thaliana" 
/variety="Columbia" 
/db_xref="taxon:3 702" 
/chromosome— ' 5 " 

join(62268..62359,62808..63132,63210..63402,63686..63747, 

63961. .64047,64215..64293,64403..64509,64678..64778 
65030..65132) 

/gene="F18O22_190" 

/note="similarity to cytochrome P450 CYP90, Arabidopsis 

thaliana, PJJR..S55379" 

/codon_start=l 

/product- 'putative protein" 

/protein_id="CAB87779.1 " 

/db_xref="GI: 75 73465" 

/translation="MGWPFIGETISFFKPHRSDSIGTFLQQRVSRYGKVFKSNICGGK 



AWSCDQELNMFILQNEGKLFTSDYPICAMHDILGKYSLLLATGEIHRKLKNVIISF 
IN 

LTKSKPDFLHCAENLSISILKSWKNCREVEFHKEVKIFTLSVMVNQLLSIKPEDPA 
RL 

YVLQDFLSYMKGFISLPIPLPGTGYTNAIKVRSNRNIHQNAIffiDMNNAIREEDFL 
DS 

IISNEDEEHAAIRAKKGDGELLNWEDYQKMEFTQCVISEALRCGNIVKTVHRKAT 
HDI 



KFNEYVIPKGWKVFPIFTAVHLDPSLHENPFEFNPMRWTKTTAFGGGVRVCPGG 
ELGK 

LQIAFFLHHLVLSYRWKDCSDEMPIAHPYVEFKRGMLLEIEPTKFLED" 
exon 62268..62359 

/gene="F18O22_190" 

/number=l 
intron 62360..62807 

/gene="F18O22_190" 

/number=l 
exon 62808..63132 

/gene="F18O22_190" 

/number=2 
intron 63 133. .63209 

/gene="F18O22_190" 

/number=2 
exon 63210..63402 

/gene="F18O22_190" 

/number=3 
intron 63403. .63685 

/gene="F18O22_190" 

/number=3 
exon 63686..63747 

/gene="F18O22_190" 

/number=4 
intron 63748..63960 

/gene="F18O22_190" 

/number=4 
exon 63961. .64047 

/gene="F18O22_190" 

/number=5 
intron 64048..64214 

/gene="F18O22_190" 



f - 



/number=5 
exon 642 15.. 64293 

/gene="F18O22_190" 

/number=6 
intron 64294..64402 

/gene="F18O22_190" 

/number=6 
exon 64403..64509 

/gene="F18O22_190" 

/number=7 
intron 64510..64677 

/gene="F18O22_190" 

/number=7 
exon 64678..64778 

/gene="F18O22_190" 

/number=8 
intron 64779..65029 

/gene="F18O22_190" 

/number=8 
exon 65030..65132 

/gene="F18O22_190" 

/number=9 
gene 65647..66148 

/gene="F18O22_200" 
gene complement(j oin(65647. .6594 1 ,66042. .66 148)) 
62221 ctaagaaaaa tgaaacaaac gatcatcact catcactaac tggcagcatg ggatggcctt 
62281 tcattggaga aactatttct ttcttcaaac ctcatagatc agactccatc ggtacattct 
62341 tgcaacaacg tgtttcacgg taataattaa catgttcttt ttatttcttt tgttgttgtt 
62401 gtcaattgta gtgcgaaatt taaattgtgt agttagcaac aacaaaaaaa ctgtgtagac 
62461 aacaatatac acccttcatg ggaactcagt tttcaaaaaa taattgaacc aaattggctt 
62521 tcagcaaaaa acgtattagc cacttaatgc ctaaacacca aataacctat tttctttaaa 
62581 ggatcattat atttttatat ataattaatt aaaagaaaaa aagtaatgtt cacagtacta 
62641 cagaagattc atatctgatt tttttgcata catgacaatt tgattgccta ccaatgtttt 
62701 tatgatttta ttttgtaaat ccaatttata tactatacaa catataatat atcttttttt 
62761 gtttgtttaa gcctaaattg acaataactg tgaatgttgt tattcaggta tggaaaagtg 
62821 ttcaagtcaa atatatgtgg tggaaaagca gtagtctcat gtgaccaaga actcaacatg 
62881 ttcatacttc aaaacgaagg gaagttgttt acatcggatt atccaaaagc gatgcatgac 
62941 attctcggca aatattccct tctattagcc accggagaaa ttcacaggaa actaaaaaat 
63001 gttattatta gcttcatcaa tctcacaaag tcgaaacctg actttcttca ctgcgcagag 
63061 aacctctcta tctcgatact aaagtcatgg aaaaattgcc gagaagtcga attccataaa 
63121 gaagttaaaa tagtaattaa ttaatcactg aatccttata atcaatattg ttataattct 
63181 gatatctttg ttgcggttga atcatacagt ttactctcag tgttatggta aaccaactct 
63241 tgagcatcaa gccagaagac ccagcaagac tttatgtatt gcaagatttt ttatcttata 
63301 tgaaagggtt tatctcctta ccaataccgc ttccaggaac gggttataca aacgcaatta 
63361 aggttagatc caatcgtaat atacatcaaa acgcaattat aggttttgtt ttcgtcgttt 
63421 tatatcaaaa cattaagcca ctaatttata agaaaataaa caattggatt ctaggttttt 
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gcagttcttg tatataactt gggtcatgat gatgctttaa tttatcatat ggttttcatg 
taaacacatt tatctgtatg taaatacata acatatatac acttaaaact atatagtttt 
aaagtcattt tcgatgtttg ccaggctagg aagagattgt cggcgagggt tatggggatg 
ataaaagaaa gagagcgcga agaagaagac atgaataatg caataagaga agaagatttt 
ctggattcga taatttcgaa tgaagatcta aattatgaag agaaagttag cattgtgttg 
gacattttgc ttggaggctt tgagacaagt gctactactc tttccttagt cgtctatttt 
ctagcaaaat ctccaaatct tcttcacaaa ctcaaggtat tctctctctc ttctcattat 
ctatatattt gtgcatgtat acacatacgt atatatgtta atctatgtga acgtatacag 
gaagaacatg cagccattag agccaagaaa ggggatgggg aacttttgaa ttgggaagat 
tatcagaaga tggaattcac tcaatgtgta tgtttcataa acccatccca tcattttatt 
ttatttttgc tattttttcg caaatttgtt ttggtacgta gccaagtttt tttctgatca 
tatgttgata ctactttaaa atcatataga acatgttcac ttatatcatg aaatttactg 
gacggaattt gcaggtgatt tctgaggcac tacgatgtgg taatatcgtc aagactgtac 
atagaaaagc tactcatgat attaaattca acggttaaca aaaatgtcaa tcattttttt 
ttttgatcaa aaaatgtcaa tcaaatttta ctaattaatg tagaaatcaa attagtgttt 
ctctaatatg tttgagttgc agaatatgtg attccaaagg ggtggaaggt gtttccaatc 
ttcacagcag tacatcttga tccctctctt catgaaaatc cttttgaatt taatcccatg 
agatggaccg taagtaaatt atttagaaac agataacatt caaatgggtt tttttttttt 
tttttttttt tttttgtcaa aaataacatt caaatggtaa gcaaagaaaa aaaacggaaa 
tattgtgaaa ccaattttgg tttttaaatt aaataggata aggctaagat gaacaagaaa 
acgacggcgt ttggaggagg agtaagggta tgtcctggtg gtgaacttgg caagctccaa 
attgctttct tccttcatca tcttgtcctc tcctataggt ttgtctaatc actcactatg 
gttgactaat tttaattagt gaacactgcc ttatccgaca taatcatcta ttatactttg 
gttaaaattt tggttggttt gatttttaaa tcgttttgtt ttgtttattt tcttctcaag 
ttctcatgtt tcagttattt tatactgatg gatacatcta tttgaaaaaa tatgtgtttt 
atcatgtttt aatgttcaat atagctaaga agtttattta taatggtagg tggaaaataa 
agtcagatga aatgccaatc gcgcaccctt acgtggagtt taagagaggc atgcttttgg 
agatagagcc aacaaaattc cttgaagatt agctgcttta ataaggaatc catcttagat 



62268..65132 

/gene="F18O22_190" 
CDS join(62268..62359,62808..631 32,632 10..63402.63686..63 747, 

63961.. 64047,64215. .64293,64403..64509,64678..64778, 
65030..65132) 
/gene="F18O22_190" 

/note-'similarity to cytochrome P450 CYP90, Arabidopsis 

thaliana, PIR:S55379" ' 

/codon_start=l 

/product="putative protein". 

/protein id=" CAB87779.1 " 

/db_xref= ,, GI:7573465" 

/db_xref="GOA:Q9LY89" 

/db_xref="SPTREMBL:Q9LY89" 

/translation="MGWPFIGETISFFKPHRSDSIGTFLQQRVSRYGKWKSNICGGK 

AWSCDQELNMFILQNEGKLFTSDWKAMHDILGKYSLLLATGEIHRKLKNVnSF 

INLTKSKPDFLHCAENLSISILKSWKNCREVEFHK^VKIFTLSVMVNQLLSIKPEDP 

APvLWLQDFLSYMKGFISLPIPLPGTGYTNAIKWSNRNIHQNAIIEDMNNAIREE 

DFLDSnSNEDEEHAAIRAKKGDGELLNWEDYQKMEFTQCVISEALRCGNIVKTV 

HPvKATHDIKFNEYVIPKGWKVFPIFTAVHLDPSLHENPFEFNPMRWTKTTAFGGG 

VRVCPGGELGKLQIAFFLHHLVLSYRWKIKSDEMPIAHPYVEFKRGMLLEIEPTK 

FLED" 



exon 62268..62359 

/gene="F18O22_190" 

/number=l 
intron 62360.. 62807 

/gene="F18O22_190" 

/number=l 
exon 62808..63132 

/gene="F18O22_190" 

/number=2 
intron 63133..63209 

/gene="F18O22_190" 

/number=2 
exon 63210..63402 

/gene="F18O22_190" 

/number=3 
intron 63403..63685 

/gene="F18O22_190" 

/number=3 
exon 63686..63747 

/gene="F18O22_190" 

/number=4 
intron 63748..63960 



/gene="F18022 
/number=4 



190 



exon 



63961.. 64047 
/gene="F18022 
/number=5 



190' 



intron 



64048.. 642 14 
/gene="F 18022 
/number=5 



190 



exon 



64215..64293 



/gene="F18O22_190' 

/number=6 
intron 64294.. 64402 

/gene="F18O22_190' 

/number=6 
exon 64403..64509 

/gene="F18O22_190' 

/number=7 
intron 645 10.. 64677 

/gene="F18O22_190' 

/number=7 
exon 64678..64778 

/gene-"F18O22_190" 

/number=8 
intron 64779..65029 

/gene="F18O22_190" 

/number=8 
exon 65030..65132 

/gene="F18O22_190" 

/number=9 



62221 ctaagaaaaa tgaaacaaac gatcatcact catcactaac tggcagcatg ggatggcc 
62281 tcattggaga aactatttct ttcttcaaac ctcatagatc agactccatc ggtacattct 
62341 tgcaacaacg tgtttcacgg taataattaa catgttcttt ttatttcttt tgttgttgtt 
62401 gtcaattgta gtgcgaaatt taaattgtgt agttagcaac aacaaaaaaa ctgtgtagac 
62461 aacaatatac acccttcatg ggaactcagt tttcaaaaaa taattgaacc aaattggctt 
62521 tcagcaaaaa acgtattagc cacttaatgc ctaaacacca aataacctat tttctttaaa 
6258 1 ggatcattat atttttatat ataattaatt aaaagaaaaa aagtaatgtt cacagtacta 
62641 cagaagattc atatctgatt tttttgcata catgacaatt tgattgccta ccaatgtttt 
62701 tatgatttta ttttgtaaat ccaatttata tactatacaa catataatat atcttttttt 
62761 gtttgtttaa gcctaaattg acaataactg tgaatgttgt tattcaggta tggaaaagtg 
62821 ttcaagtcaa atatatgtgg tggaaaagca gtagtctcat gtgaccaaga actcaacatg 
62881 ttcatacttc aaaacgaagg gaagttgttt acatcggatt atccaaaagc gatgcatgac 
62941 attctcggca aatattccct tctattagcc accggagaaa ttcacaggaa actaaaaaat 
63001 gttattatta gcttcatcaa tctcacaaag tcgaaacctg actttcttca ctgcgcagag 
63061 aacctctcta tctcgatact aaagtcatgg aaaaattgcc gagaagtcga attccataaa 



63121 gaagttaaaa tagtaattaa ttaatcactg aatccttata atcaatattg ttataattct 
63181 gatatctttg ttgcggttga atcatacagt ttactctcag tgttatggta aaccaactct 
6324 1 tgagcatcaa gccagaagac ccagcaagac tttatgtatt gcaagatttt ttatcttata 
63301 tgaaagggtt tatctcctta ccaataccgc ttccaggaac gggttataca aacgcaatta 
63361 aggttagatc caatcgtaat atacatcaaa acgcaattat aggttttgtt ttcgtcgttt 
63421 tatatcaaaa cattaagcca ctaatttata agaaaataaa caattggatt ctaggttttt 
63481 gcagttcttg tatataactt gggtcatgat gatgctttaa tttatcatat ggttttcatg 
63541 taaacacatt tatctgtatg taaatacata acatatatac acttaaaact atatagtttt 
63601 aaagtcattt tcgatgtttg ccaggctagg aagagattgt cggcgagggt tatggggatg 
63661 ataaaagaaa gagagcgcga agaagaagac atgaataatg caataagaga agaagatttt 
63721 ctggattcga taatttcgaa tgaagatcta aattatgaag agaaagttag cattgtgttg 
63781 gacattttgc ttggaggctt tgagacaagt gctactactc tttccttagt cgtctatttt 
63841 ctagcaaaat ctccaaatct tcttcacaaa ctcaaggtat tctctctctc ttctcattat 
63901 ctatatattt gtgcatgtat acacatacgt atatatgtta atctatgtga acgtatacag 
63961 gaagaacatg cagccattag agccaagaaa ggggatgggg aacttttgaa ttgggaagat 
64021 tatcagaaga tggaattcac tcaatgtgta tgtttcataa acccatccca tcattttatt 
64081 ttatttttgc tattttttcg caaatttgtt ttggtacgta gccaagtttt tttctgatca 
64141 tatgttgata ctactttaaa atcatataga acatgttcac ttatatcatg aaatttactg 
64201 gacggaattt gcaggtgatt tctgaggcac tacgatgtgg taatatcgtc aagactgtac 
64261 atagaaaagc tactcatgat attaaattca acggttaaca aaaatgtcaa tcattttttt 
64321 ttttgatcaa aaaatgtcaa tcaaatttta ctaattaatg tagaaatcaa attagtgttt 
64381 ctctaatatg tttgagttgc agaatatgtg attccaaagg ggtggaaggt gtttccaatc 
64441 ttcacagcag tacatcttga tccctctctt catgaaaatc cttttgaatt taatcccatg 
64501 agatggaccg taagtaaatt atttagaaac agataacatt caaatgggtt tttttttttt 
64561 tttttttttt tttttgtcaa aaataacatt caaatggtaa gcaaagaaaa aaaacggaaa 
64621 tattgtgaaa ccaattttgg tttttaaatt aaataggata aggctaagat gaacaagaaa 
64681 acgacggcgt ttggaggagg agtaagggta tgtcctggtg gtgaacttgg caagctccaa 
64741 attgctttct tccttcatca tcttgtcctc tcctataggt ttgtctaatc actcactatg 
64801 gttgactaat tttaattagt gaacactgcc ttatccgaca taatcatcta ttatactttg 
64861 gttaaaattt tggttggttt gatttttaaa tcgttttgtt ttgtttattt tcttctcaag 
64921 ttctcatgtt tcagttattt tatactgatg gatacatcta tttgaaaaaa tatgtgtttt 
6498 1 atcatgtttt aatgttcaat atagctaaga agtttattta taatggtagg tggaaaataa 
65041 agtcagatga aatgccaatc gcgcaccctt acgtggagtt taagagaggc atgcttttgg 
65101 agatagagcc aacaaaattc cttgaagatt agctgcttta ataaggaatc catcttagat 
65161 gaaaatttaa ttaataacta ttaaaaatgt aatgtctaat aatatgctcc attcccaaat 
65221 ataaagtact ctctagtcat agagttttgt gttagttagg tgttggagtt ttgaaataaa 
65281 atgagtattt gcctttggat atggtatatt tatgttttgt ttctagaaaa attatctaaa 
65341 atattactat ttcgagtttt tatatcatat ccgcgttcaa atatttaatt agtgttacta 
65401 ggcaatctaa cacaatactt ttaaaataac tcaaatcatg tgtatactat aaagtagtta 



CAB87779. putative protein ...[gi: 75 73465] 

LOCUS CAB87779 382 aa linear PLN 13-APR-2000 

DEFINITION putative protein [Arabidopsis thaliana]. 

ACCESSION CAB87779 

VERSION CAB87779.1 Gl:7573465 

DBSOURCE embl locus ATF18022, accession AL1 6381 7.1 

KEYWORDS . 

SOURCE Arabidopsis thaliana (thale cress) 
ORGANISM Arabidopsis thaliana 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; 

rosids; eurosids II; Brassicales; Brassicaceae; Arabidopsis. 
REFERENCE 1 (residues 1 to 382) 

AUTHORS Bevan.M., Murphy.G., Ridley.P., Hudson.S., Bancroft,!., Mewes.H.W., 

Rudd.S., Lemcke.K. and Mayer.K.F.X. 
JOURNAL Unpublished 
REFERENCE 2 (residues 1 to 382) 
AUTHORS EU Arabidopsis sequencing, project. 
TITLE Direct Submission 

JOURNAL Submitted (1 3-APR-2000) MIPS, at the Max-Planck-lnstitut fuer 
Biochemie, Am Klopferspitz 18a, D-82152 Martinsried, FRG, E-mail: 
lemcke@mips.biochem.mpg.de,mayer@mips.biochem.mpg.de Project 
Coordinator: Mike Bevan, Molecular Genetics Department, Cambridge 
Laboratory, John Innes Centre, Colney Lane, NR4 7UJ Norwich, UK, 
E-mail: michael.bevan@bbsrc.ac.uk 
COMMENT Information on performance of analysis and a more detailed 
annotation of this entry and other sequences of chromosomes 3, 4 
and 5 can be viewed at: http://www.mips.biochem.mpg.de/proj/thal/ . 
FEATURES Location/Qualifiers 
source 1..382 

/organism="Arabidopsis thaliana" 
/variety-'Columbia" 
/db_xref="taxon:3702" 
/chromosome="5" 
Protein 1..382 

/product-'putative protein" 



CDS 1..382 

/gene="F18O22_190" 

/coded_by="join(AL1 6381 7. 1 :62268.. 62359, 

AL1 6381 7. 1 :62808..631 32.AL1 6381 7. 1 :6321 0..63402, 

AL1 6381 7. 1 :63686..63747,AL1 6381 7. 1 :63961 ..64047, 

AL163817.1:64215..64293,AL163817.1:64403..64509, 

AL1 63817. 1 :64678..64778,AL1 6381 7. 1 :65030..651 32)" 

/note="similarity to cytochrome P450 CYP90, Arabidopsis 

thaliana, PIR:S55379" 

/db_xref="GOA:Q9LY89" 

/db_xref="SPTREMBL:Q9LY89" 

ORIGIN 

1 mgwpfigeti sffkphrsds igtflqqrvs rygkvfksni cggkavvscd qelnmfilqn 
61 egklftsdyp kamhdilgky slllatgeih rklknviisf inltkskpdf Ihcaenlsis 
121 ilkswkncre vefhkevkif tlsvmvnqll sikpedparl yvlqdflsym kgfislpipl 
181 pgtgytnaik vrsnrnihqn aiiedmnnai reedfldsii snedeehaai rakkgdgell 
241 nwedyqkmef tqcvisealr cgnivktvhr kathdikfne yvipkgwkvf piftavhldp 
301 slhenpfefn pmrwtkttaf gggvrvcpgg elgklqiaff Ihhlvlsyrw kiksdempia 
361 hpyvefkrgm lleieptkfl ed 



